Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 36275 |
| Missing cells | 12929 |
| Missing cells (%) | 1.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 5.3 MiB |
| Average record size in memory | 152.0 B |
Variable types
| Text | 1 |
|---|---|
| Categorical | 8 |
| Numeric | 10 |
no_of_adults is highly imbalanced (52.4%) | Imbalance |
required_car_parking_space is highly imbalanced (80.2%) | Imbalance |
room_type_reserved is highly imbalanced (62.6%) | Imbalance |
repeated_guest is highly imbalanced (82.8%) | Imbalance |
no_of_adults has 413 (1.1%) missing values | Missing |
no_of_weekend_nights has 367 (1.0%) missing values | Missing |
no_of_week_nights has 807 (2.2%) missing values | Missing |
type_of_meal_plan has 526 (1.5%) missing values | Missing |
required_car_parking_space has 2592 (7.1%) missing values | Missing |
room_type_reserved has 1171 (3.2%) missing values | Missing |
lead_time has 472 (1.3%) missing values | Missing |
arrival_year has 378 (1.0%) missing values | Missing |
arrival_month has 504 (1.4%) missing values | Missing |
arrival_date has 981 (2.7%) missing values | Missing |
market_segment_type has 1512 (4.2%) missing values | Missing |
repeated_guest has 586 (1.6%) missing values | Missing |
no_of_previous_cancellations has 497 (1.4%) missing values | Missing |
no_of_previous_bookings_not_canceled has 550 (1.5%) missing values | Missing |
avg_price_per_room has 460 (1.3%) missing values | Missing |
no_of_special_requests has 789 (2.2%) missing values | Missing |
no_of_previous_cancellations is highly skewed (γ1 = 25.03312517) | Skewed |
Booking_ID has unique values | Unique |
no_of_children has 33275 (91.7%) zeros | Zeros |
no_of_weekend_nights has 16715 (46.1%) zeros | Zeros |
no_of_week_nights has 2327 (6.4%) zeros | Zeros |
lead_time has 1277 (3.5%) zeros | Zeros |
no_of_previous_cancellations has 35441 (97.7%) zeros | Zeros |
no_of_previous_bookings_not_canceled has 34923 (96.3%) zeros | Zeros |
avg_price_per_room has 539 (1.5%) zeros | Zeros |
no_of_special_requests has 19350 (53.3%) zeros | Zeros |
Reproduction
| Analysis started | 2024-05-10 12:17:12.115002 |
|---|---|
| Analysis finished | 2024-05-10 12:17:17.517100 |
| Duration | 5.4 seconds |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
Booking_ID
Text
UNIQUE 
| Distinct | 36275 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 283.5 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 290200 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 36275 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | INN00001 |
|---|---|
| 2nd row | INN00002 |
| 3rd row | INN00003 |
| 4th row | INN00004 |
| 5th row | INN00005 |
| Value | Count | Frequency (%) |
| inn00001 | 1 | < 0.1% |
| inn00007 | 1 | < 0.1% |
| inn00070 | 1 | < 0.1% |
| inn00009 | 1 | < 0.1% |
| inn00003 | 1 | < 0.1% |
| inn00004 | 1 | < 0.1% |
| inn00005 | 1 | < 0.1% |
| inn00006 | 1 | < 0.1% |
| inn00008 | 1 | < 0.1% |
| inn00020 | 1 | < 0.1% |
| Other values (36265) | 36265 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 72550 | |
| I | 36275 | |
| 1 | 24958 | 8.6% |
| 0 | 24953 | 8.6% |
| 2 | 24934 | 8.6% |
| 3 | 21134 | 7.3% |
| 4 | 14858 | 5.1% |
| 5 | 14858 | 5.1% |
| 6 | 14133 | 4.9% |
| 7 | 13853 | 4.8% |
| Other values (2) | 27694 | 9.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 290200 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 72550 | |
| I | 36275 | |
| 1 | 24958 | 8.6% |
| 0 | 24953 | 8.6% |
| 2 | 24934 | 8.6% |
| 3 | 21134 | 7.3% |
| 4 | 14858 | 5.1% |
| 5 | 14858 | 5.1% |
| 6 | 14133 | 4.9% |
| 7 | 13853 | 4.8% |
| Other values (2) | 27694 | 9.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 290200 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 72550 | |
| I | 36275 | |
| 1 | 24958 | 8.6% |
| 0 | 24953 | 8.6% |
| 2 | 24934 | 8.6% |
| 3 | 21134 | 7.3% |
| 4 | 14858 | 5.1% |
| 5 | 14858 | 5.1% |
| 6 | 14133 | 4.9% |
| 7 | 13853 | 4.8% |
| Other values (2) | 27694 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 290200 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 72550 | |
| I | 36275 | |
| 1 | 24958 | 8.6% |
| 0 | 24953 | 8.6% |
| 2 | 24934 | 8.6% |
| 3 | 21134 | 7.3% |
| 4 | 14858 | 5.1% |
| 5 | 14858 | 5.1% |
| 6 | 14133 | 4.9% |
| 7 | 13853 | 4.8% |
| Other values (2) | 27694 | 9.5% |
no_of_adults
Categorical
IMBALANCE  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 413 |
| Missing (%) | 1.1% |
| Memory size | 283.5 KiB |
| 2.0 | |
|---|---|
| 1.0 | |
| 3.0 | 2290 |
| 0.0 | 137 |
| 4.0 | 16 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 107586 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 2.0 |
| 4th row | 2.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 25813 | |
| 1.0 | 7606 | 21.0% |
| 3.0 | 2290 | 6.3% |
| 0.0 | 137 | 0.4% |
| 4.0 | 16 | < 0.1% |
| (Missing) | 413 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2.0 | 25813 | |
| 1.0 | 7606 | 21.2% |
| 3.0 | 2290 | 6.4% |
| 0.0 | 137 | 0.4% |
| 4.0 | 16 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 35999 | |
| . | 35862 | |
| 2 | 25813 | |
| 1 | 7606 | 7.1% |
| 3 | 2290 | 2.1% |
| 4 | 16 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 107586 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35999 | |
| . | 35862 | |
| 2 | 25813 | |
| 1 | 7606 | 7.1% |
| 3 | 2290 | 2.1% |
| 4 | 16 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 107586 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35999 | |
| . | 35862 | |
| 2 | 25813 | |
| 1 | 7606 | 7.1% |
| 3 | 2290 | 2.1% |
| 4 | 16 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 107586 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 35999 | |
| . | 35862 | |
| 2 | 25813 | |
| 1 | 7606 | 7.1% |
| 3 | 2290 | 2.1% |
| 4 | 16 | < 0.1% |
no_of_children
Real number (ℝ)
ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 324 |
| Missing (%) | 0.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.10536564 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 33275 |
| Zeros (%) | 91.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4028713 |
|---|---|
| Coefficient of variation (CV) | 3.8235549 |
| Kurtosis | 37.117523 |
| Mean | 0.10536564 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.7144081 |
| Sum | 3788 |
| Variance | 0.16230528 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 33275 | |
| 1 | 1605 | 4.4% |
| 2 | 1049 | 2.9% |
| 3 | 19 | 0.1% |
| 9 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| (Missing) | 324 | 0.9% |
| Value | Count | Frequency (%) |
| 0 | 33275 | |
| 1 | 1605 | 4.4% |
| 2 | 1049 | 2.9% |
| 3 | 19 | 0.1% |
| 9 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 3 | 19 | 0.1% |
| 2 | 1049 | 2.9% |
| 1 | 1605 | 4.4% |
| 0 | 33275 |
no_of_weekend_nights
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 367 |
| Missing (%) | 1.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.81020942 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 16715 |
| Zeros (%) | 46.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.87085698 |
|---|---|
| Coefficient of variation (CV) | 1.0748542 |
| Kurtosis | 0.31186403 |
| Mean | 0.81020942 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.74090195 |
| Sum | 29093 |
| Variance | 0.75839188 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16715 | |
| 1 | 9888 | |
| 2 | 8970 | |
| 3 | 152 | 0.4% |
| 4 | 128 | 0.4% |
| 5 | 34 | 0.1% |
| 6 | 20 | 0.1% |
| 7 | 1 | < 0.1% |
| (Missing) | 367 | 1.0% |
| Value | Count | Frequency (%) |
| 0 | 16715 | |
| 1 | 9888 | |
| 2 | 8970 | |
| 3 | 152 | 0.4% |
| 4 | 128 | 0.4% |
| 5 | 34 | 0.1% |
| 6 | 20 | 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 6 | 20 | 0.1% |
| 5 | 34 | 0.1% |
| 4 | 128 | 0.4% |
| 3 | 152 | 0.4% |
| 2 | 8970 | |
| 1 | 9888 | |
| 0 | 16715 |
no_of_week_nights
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 807 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.20331 |
| Minimum | 0 |
|---|---|
| Maximum | 17 |
| Zeros | 2327 |
| Zeros (%) | 6.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4098899 |
|---|---|
| Coefficient of variation (CV) | 0.63989629 |
| Kurtosis | 7.8816893 |
| Mean | 2.20331 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.604805 |
| Sum | 78147 |
| Variance | 1.9877895 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 11191 | |
| 1 | 9295 | |
| 3 | 7660 | |
| 4 | 2914 | 8.0% |
| 0 | 2327 | 6.4% |
| 5 | 1584 | 4.4% |
| 6 | 184 | 0.5% |
| 7 | 109 | 0.3% |
| 8 | 61 | 0.2% |
| 10 | 58 | 0.2% |
| Other values (8) | 85 | 0.2% |
| (Missing) | 807 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 2327 | 6.4% |
| 1 | 9295 | |
| 2 | 11191 | |
| 3 | 7660 | |
| 4 | 2914 | 8.0% |
| 5 | 1584 | 4.4% |
| 6 | 184 | 0.5% |
| 7 | 109 | 0.3% |
| 8 | 61 | 0.2% |
| 9 | 32 | 0.1% |
| Value | Count | Frequency (%) |
| 17 | 3 | < 0.1% |
| 16 | 2 | < 0.1% |
| 15 | 10 | < 0.1% |
| 14 | 7 | < 0.1% |
| 13 | 5 | < 0.1% |
| 12 | 9 | < 0.1% |
| 11 | 17 | < 0.1% |
| 10 | 58 | |
| 9 | 32 | |
| 8 | 61 |
type_of_meal_plan
Categorical
MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 526 |
| Missing (%) | 1.5% |
| Memory size | 283.5 KiB |
| Meal Plan 1 | |
|---|---|
| Not Selected | |
| Meal Plan 2 | |
| Meal Plan 3 | 5 |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 11.141459 |
| Min length | 11 |
Characters and Unicode
| Total characters | 398296 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not Selected |
|---|---|
| 2nd row | Meal Plan 1 |
| 3rd row | Meal Plan 1 |
| 4th row | Not Selected |
| 5th row | Meal Plan 2 |
Common Values
| Value | Count | Frequency (%) |
| Meal Plan 1 | 27421 | |
| Not Selected | 5057 | 13.9% |
| Meal Plan 2 | 3266 | 9.0% |
| Meal Plan 3 | 5 | < 0.1% |
| (Missing) | 526 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| meal | 30692 | |
| plan | 30692 | |
| 1 | 27421 | |
| not | 5057 | 4.9% |
| selected | 5057 | 4.9% |
| 2 | 3266 | 3.2% |
| 3 | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 66441 | |
| 66441 | ||
| a | 61384 | |
| e | 45863 | |
| M | 30692 | |
| P | 30692 | |
| n | 30692 | |
| 1 | 27421 | |
| t | 10114 | 2.5% |
| N | 5057 | 1.3% |
| Other values (6) | 23499 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 398296 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 66441 | |
| 66441 | ||
| a | 61384 | |
| e | 45863 | |
| M | 30692 | |
| P | 30692 | |
| n | 30692 | |
| 1 | 27421 | |
| t | 10114 | 2.5% |
| N | 5057 | 1.3% |
| Other values (6) | 23499 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 398296 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 66441 | |
| 66441 | ||
| a | 61384 | |
| e | 45863 | |
| M | 30692 | |
| P | 30692 | |
| n | 30692 | |
| 1 | 27421 | |
| t | 10114 | 2.5% |
| N | 5057 | 1.3% |
| Other values (6) | 23499 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 398296 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 66441 | |
| 66441 | ||
| a | 61384 | |
| e | 45863 | |
| M | 30692 | |
| P | 30692 | |
| n | 30692 | |
| 1 | 27421 | |
| t | 10114 | 2.5% |
| N | 5057 | 1.3% |
| Other values (6) | 23499 | 5.9% |
required_car_parking_space
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2592 |
| Missing (%) | 7.1% |
| Memory size | 283.5 KiB |
| 0.0 | |
|---|---|
| 1.0 | 1034 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 101049 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 32649 | |
| 1.0 | 1034 | 2.9% |
| (Missing) | 2592 | 7.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 32649 | |
| 1.0 | 1034 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 66332 | |
| . | 33683 | |
| 1 | 1034 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 101049 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 66332 | |
| . | 33683 | |
| 1 | 1034 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 101049 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 66332 | |
| . | 33683 | |
| 1 | 1034 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 101049 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 66332 | |
| . | 33683 | |
| 1 | 1034 | 1.0% |
room_type_reserved
Categorical
IMBALANCE  MISSING 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1171 |
| Missing (%) | 3.2% |
| Memory size | 283.5 KiB |
| Room_Type 1 | |
|---|---|
| Room_Type 4 | |
| Room_Type 6 | 939 |
| Room_Type 2 | 664 |
| Room_Type 5 | 256 |
| Other values (2) | 160 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Characters and Unicode
| Total characters | 386144 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Room_Type 1 |
|---|---|
| 2nd row | Room_Type 1 |
| 3rd row | Room_Type 1 |
| 4th row | Room_Type 1 |
| 5th row | Room_Type 1 |
Common Values
| Value | Count | Frequency (%) |
| Room_Type 1 | 27234 | |
| Room_Type 4 | 5851 | 16.1% |
| Room_Type 6 | 939 | 2.6% |
| Room_Type 2 | 664 | 1.8% |
| Room_Type 5 | 256 | 0.7% |
| Room_Type 7 | 154 | 0.4% |
| Room_Type 3 | 6 | < 0.1% |
| (Missing) | 1171 | 3.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| room_type | 35104 | |
| 1 | 27234 | |
| 4 | 5851 | 8.3% |
| 6 | 939 | 1.3% |
| 2 | 664 | 0.9% |
| 5 | 256 | 0.4% |
| 7 | 154 | 0.2% |
| 3 | 6 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 70208 | |
| R | 35104 | |
| m | 35104 | |
| _ | 35104 | |
| T | 35104 | |
| y | 35104 | |
| p | 35104 | |
| e | 35104 | |
| 35104 | ||
| 1 | 27234 | 7.1% |
| Other values (6) | 7870 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 386144 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 70208 | |
| R | 35104 | |
| m | 35104 | |
| _ | 35104 | |
| T | 35104 | |
| y | 35104 | |
| p | 35104 | |
| e | 35104 | |
| 35104 | ||
| 1 | 27234 | 7.1% |
| Other values (6) | 7870 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 386144 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 70208 | |
| R | 35104 | |
| m | 35104 | |
| _ | 35104 | |
| T | 35104 | |
| y | 35104 | |
| p | 35104 | |
| e | 35104 | |
| 35104 | ||
| 1 | 27234 | 7.1% |
| Other values (6) | 7870 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 386144 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 70208 | |
| R | 35104 | |
| m | 35104 | |
| _ | 35104 | |
| T | 35104 | |
| y | 35104 | |
| p | 35104 | |
| e | 35104 | |
| 35104 | ||
| 1 | 27234 | 7.1% |
| Other values (6) | 7870 | 2.0% |
lead_time
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 352 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 472 |
| Missing (%) | 1.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 85.276569 |
| Minimum | 0 |
|---|---|
| Maximum | 443 |
| Zeros | 1277 |
| Zeros (%) | 3.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 17 |
| median | 57 |
| Q3 | 126 |
| 95-th percentile | 273 |
| Maximum | 443 |
| Range | 443 |
| Interquartile range (IQR) | 109 |
Descriptive statistics
| Standard deviation | 85.998845 |
|---|---|
| Coefficient of variation (CV) | 1.0084698 |
| Kurtosis | 1.1780846 |
| Mean | 85.276569 |
| Median Absolute Deviation (MAD) | 47 |
| Skewness | 1.29236 |
| Sum | 3053157 |
| Variance | 7395.8013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1277 | 3.5% |
| 1 | 1068 | 2.9% |
| 2 | 631 | 1.7% |
| 3 | 622 | 1.7% |
| 4 | 620 | 1.7% |
| 5 | 574 | 1.6% |
| 6 | 517 | 1.4% |
| 8 | 433 | 1.2% |
| 7 | 421 | 1.2% |
| 12 | 404 | 1.1% |
| Other values (342) | 29236 | |
| (Missing) | 472 | 1.3% |
| Value | Count | Frequency (%) |
| 0 | 1277 | |
| 1 | 1068 | |
| 2 | 631 | |
| 3 | 622 | |
| 4 | 620 | |
| 5 | 574 | |
| 6 | 517 | |
| 7 | 421 | 1.2% |
| 8 | 433 | 1.2% |
| 9 | 330 | 0.9% |
| Value | Count | Frequency (%) |
| 443 | 22 | 0.1% |
| 433 | 20 | 0.1% |
| 418 | 59 | |
| 386 | 69 | |
| 381 | 2 | < 0.1% |
| 377 | 68 | |
| 372 | 1 | < 0.1% |
| 361 | 5 | < 0.1% |
| 359 | 16 | < 0.1% |
| 355 | 1 | < 0.1% |
arrival_year
Categorical
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 378 |
| Missing (%) | 1.0% |
| Memory size | 283.5 KiB |
| 2018.0 | |
|---|---|
| 2017.0 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 215382 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2018.0 |
|---|---|
| 2nd row | 2018.0 |
| 3rd row | 2018.0 |
| 4th row | 2018.0 |
| 5th row | 2018.0 |
Common Values
| Value | Count | Frequency (%) |
| 2018.0 | 29451 | |
| 2017.0 | 6446 | 17.8% |
| (Missing) | 378 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2018.0 | 29451 | |
| 2017.0 | 6446 | 18.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 71794 | |
| 2 | 35897 | |
| 1 | 35897 | |
| . | 35897 | |
| 8 | 29451 | |
| 7 | 6446 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 215382 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 71794 | |
| 2 | 35897 | |
| 1 | 35897 | |
| . | 35897 | |
| 8 | 29451 | |
| 7 | 6446 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 215382 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 71794 | |
| 2 | 35897 | |
| 1 | 35897 | |
| . | 35897 | |
| 8 | 29451 | |
| 7 | 6446 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 215382 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 71794 | |
| 2 | 35897 | |
| 1 | 35897 | |
| . | 35897 | |
| 8 | 29451 | |
| 7 | 6446 | 3.0% |
arrival_month
Real number (ℝ)
MISSING 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 504 |
| Missing (%) | 1.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.4240306 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 8 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.0682767 |
|---|---|
| Coefficient of variation (CV) | 0.41328988 |
| Kurtosis | -0.93196042 |
| Mean | 7.4240306 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.34793619 |
| Sum | 265565 |
| Variance | 9.4143221 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 5238 | |
| 9 | 4550 | |
| 8 | 3761 | |
| 6 | 3162 | |
| 12 | 2977 | |
| 11 | 2937 | |
| 7 | 2887 | |
| 4 | 2700 | |
| 5 | 2563 | |
| 3 | 2328 | |
| Other values (2) | 2668 |
| Value | Count | Frequency (%) |
| 1 | 1000 | 2.8% |
| 2 | 1668 | 4.6% |
| 3 | 2328 | |
| 4 | 2700 | |
| 5 | 2563 | |
| 6 | 3162 | |
| 7 | 2887 | |
| 8 | 3761 | |
| 9 | 4550 | |
| 10 | 5238 |
| Value | Count | Frequency (%) |
| 12 | 2977 | |
| 11 | 2937 | |
| 10 | 5238 | |
| 9 | 4550 | |
| 8 | 3761 | |
| 7 | 2887 | |
| 6 | 3162 | |
| 5 | 2563 | |
| 4 | 2700 | |
| 3 | 2328 |
arrival_date
Real number (ℝ)
MISSING 
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 981 |
| Missing (%) | 2.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.605712 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.7434836 |
|---|---|
| Coefficient of variation (CV) | 0.56027457 |
| Kurtosis | -1.157733 |
| Mean | 15.605712 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.027554339 |
| Sum | 550788 |
| Variance | 76.448505 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 1321 | 3.6% |
| 17 | 1314 | 3.6% |
| 2 | 1292 | 3.6% |
| 4 | 1287 | 3.5% |
| 19 | 1286 | 3.5% |
| 16 | 1270 | 3.5% |
| 20 | 1243 | 3.4% |
| 15 | 1238 | 3.4% |
| 18 | 1232 | 3.4% |
| 6 | 1231 | 3.4% |
| Other values (21) | 22580 |
| Value | Count | Frequency (%) |
| 1 | 1104 | |
| 2 | 1292 | |
| 3 | 1076 | |
| 4 | 1287 | |
| 5 | 1119 | |
| 6 | 1231 | |
| 7 | 1077 | |
| 8 | 1166 | |
| 9 | 1103 | |
| 10 | 1062 |
| Value | Count | Frequency (%) |
| 31 | 565 | |
| 30 | 1178 | |
| 29 | 1167 | |
| 28 | 1109 | |
| 27 | 1031 | |
| 26 | 1117 | |
| 25 | 1111 | |
| 24 | 1074 | |
| 23 | 961 | |
| 22 | 997 |
market_segment_type
Categorical
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1512 |
| Missing (%) | 4.2% |
| Memory size | 283.5 KiB |
| Online | |
|---|---|
| Offline | |
| Corporate | 1926 |
| Complementary | 375 |
| Aviation | 122 |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.5385899 |
| Min length | 6 |
Characters and Unicode
| Total characters | 227301 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Online |
|---|---|
| 2nd row | Online |
| 3rd row | Online |
| 4th row | Online |
| 5th row | Online |
Common Values
| Value | Count | Frequency (%) |
| Online | 22264 | |
| Offline | 10076 | |
| Corporate | 1926 | 5.3% |
| Complementary | 375 | 1.0% |
| Aviation | 122 | 0.3% |
| (Missing) | 1512 | 4.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| online | 22264 | |
| offline | 10076 | |
| corporate | 1926 | 5.5% |
| complementary | 375 | 1.1% |
| aviation | 122 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 55101 | |
| e | 35016 | |
| l | 32715 | |
| i | 32584 | |
| O | 32340 | |
| f | 20152 | 8.9% |
| o | 4349 | 1.9% |
| r | 4227 | 1.9% |
| a | 2423 | 1.1% |
| t | 2423 | 1.1% |
| Other values (6) | 5971 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 227301 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 55101 | |
| e | 35016 | |
| l | 32715 | |
| i | 32584 | |
| O | 32340 | |
| f | 20152 | 8.9% |
| o | 4349 | 1.9% |
| r | 4227 | 1.9% |
| a | 2423 | 1.1% |
| t | 2423 | 1.1% |
| Other values (6) | 5971 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 227301 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 55101 | |
| e | 35016 | |
| l | 32715 | |
| i | 32584 | |
| O | 32340 | |
| f | 20152 | 8.9% |
| o | 4349 | 1.9% |
| r | 4227 | 1.9% |
| a | 2423 | 1.1% |
| t | 2423 | 1.1% |
| Other values (6) | 5971 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 227301 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 55101 | |
| e | 35016 | |
| l | 32715 | |
| i | 32584 | |
| O | 32340 | |
| f | 20152 | 8.9% |
| o | 4349 | 1.9% |
| r | 4227 | 1.9% |
| a | 2423 | 1.1% |
| t | 2423 | 1.1% |
| Other values (6) | 5971 | 2.6% |
repeated_guest
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 586 |
| Missing (%) | 1.6% |
| Memory size | 283.5 KiB |
| 0.0 | |
|---|---|
| 1.0 | 916 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 107067 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 34773 | |
| 1.0 | 916 | 2.5% |
| (Missing) | 586 | 1.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 34773 | |
| 1.0 | 916 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 70462 | |
| . | 35689 | |
| 1 | 916 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 107067 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 70462 | |
| . | 35689 | |
| 1 | 916 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 107067 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 70462 | |
| . | 35689 | |
| 1 | 916 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 107067 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 70462 | |
| . | 35689 | |
| 1 | 916 | 0.9% |
no_of_previous_cancellations
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 497 |
| Missing (%) | 1.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.023645816 |
| Minimum | 0 |
|---|---|
| Maximum | 13 |
| Zeros | 35441 |
| Zeros (%) | 97.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.37083473 |
|---|---|
| Coefficient of variation (CV) | 15.68289 |
| Kurtosis | 722.93754 |
| Mean | 0.023645816 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 25.033125 |
| Sum | 846 |
| Variance | 0.1375184 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 35441 | |
| 1 | 197 | 0.5% |
| 2 | 46 | 0.1% |
| 3 | 43 | 0.1% |
| 11 | 25 | 0.1% |
| 5 | 11 | < 0.1% |
| 4 | 10 | < 0.1% |
| 13 | 4 | < 0.1% |
| 6 | 1 | < 0.1% |
| (Missing) | 497 | 1.4% |
| Value | Count | Frequency (%) |
| 0 | 35441 | |
| 1 | 197 | 0.5% |
| 2 | 46 | 0.1% |
| 3 | 43 | 0.1% |
| 4 | 10 | < 0.1% |
| 5 | 11 | < 0.1% |
| 6 | 1 | < 0.1% |
| 11 | 25 | 0.1% |
| 13 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 13 | 4 | < 0.1% |
| 11 | 25 | 0.1% |
| 6 | 1 | < 0.1% |
| 5 | 11 | < 0.1% |
| 4 | 10 | < 0.1% |
| 3 | 43 | 0.1% |
| 2 | 46 | 0.1% |
| 1 | 197 | 0.5% |
| 0 | 35441 |
no_of_previous_bookings_not_canceled
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 59 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 550 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.15445766 |
| Minimum | 0 |
|---|---|
| Maximum | 58 |
| Zeros | 34923 |
| Zeros (%) | 96.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 58 |
| Range | 58 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.764805 |
|---|---|
| Coefficient of variation (CV) | 11.425817 |
| Kurtosis | 453.13648 |
| Mean | 0.15445766 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 19.175712 |
| Sum | 5518 |
| Variance | 3.1145365 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 34923 | |
| 1 | 227 | 0.6% |
| 2 | 109 | 0.3% |
| 3 | 79 | 0.2% |
| 4 | 64 | 0.2% |
| 5 | 59 | 0.2% |
| 6 | 36 | 0.1% |
| 8 | 23 | 0.1% |
| 7 | 22 | 0.1% |
| 10 | 19 | 0.1% |
| Other values (49) | 164 | 0.5% |
| (Missing) | 550 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 34923 | |
| 1 | 227 | 0.6% |
| 2 | 109 | 0.3% |
| 3 | 79 | 0.2% |
| 4 | 64 | 0.2% |
| 5 | 59 | 0.2% |
| 6 | 36 | 0.1% |
| 7 | 22 | 0.1% |
| 8 | 23 | 0.1% |
| 9 | 19 | 0.1% |
| Value | Count | Frequency (%) |
| 58 | 1 | |
| 57 | 1 | |
| 56 | 1 | |
| 55 | 1 | |
| 54 | 1 | |
| 53 | 1 | |
| 52 | 1 | |
| 51 | 1 | |
| 50 | 1 | |
| 49 | 1 |
avg_price_per_room
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 3905 |
|---|---|
| Distinct (%) | 10.9% |
| Missing | 460 |
| Missing (%) | 1.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 103.41821 |
| Minimum | 0 |
|---|---|
| Maximum | 540 |
| Zeros | 539 |
| Zeros (%) | 1.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 61 |
| Q1 | 80.3 |
| median | 99.45 |
| Q3 | 120 |
| 95-th percentile | 164.9 |
| Maximum | 540 |
| Range | 540 |
| Interquartile range (IQR) | 39.7 |
Descriptive statistics
| Standard deviation | 35.057342 |
|---|---|
| Coefficient of variation (CV) | 0.33898617 |
| Kurtosis | 3.1192587 |
| Mean | 103.41821 |
| Median Absolute Deviation (MAD) | 20.25 |
| Skewness | 0.65625004 |
| Sum | 3703923.1 |
| Variance | 1229.0172 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 65 | 840 | 2.3% |
| 75 | 815 | 2.2% |
| 90 | 693 | 1.9% |
| 115 | 658 | 1.8% |
| 95 | 658 | 1.8% |
| 120 | 604 | 1.7% |
| 100 | 598 | 1.6% |
| 110 | 553 | 1.5% |
| 0 | 539 | 1.5% |
| 85 | 501 | 1.4% |
| Other values (3895) | 29356 | |
| (Missing) | 460 | 1.3% |
| Value | Count | Frequency (%) |
| 0 | 539 | |
| 0.5 | 1 | < 0.1% |
| 1 | 9 | < 0.1% |
| 1.48 | 1 | < 0.1% |
| 1.6 | 1 | < 0.1% |
| 2 | 6 | < 0.1% |
| 3 | 3 | < 0.1% |
| 4.5 | 1 | < 0.1% |
| 6 | 25 | 0.1% |
| 6.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 540 | 1 | < 0.1% |
| 375.5 | 1 | < 0.1% |
| 365 | 1 | < 0.1% |
| 349.63 | 1 | < 0.1% |
| 316 | 1 | < 0.1% |
| 314.1 | 1 | < 0.1% |
| 306 | 2 | < 0.1% |
| 300 | 5 | |
| 299.33 | 1 | < 0.1% |
| 297 | 1 | < 0.1% |
no_of_special_requests
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 789 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.61934284 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 19350 |
| Zeros (%) | 53.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 283.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.78584926 |
|---|---|
| Coefficient of variation (CV) | 1.2688437 |
| Kurtosis | 0.88609123 |
| Mean | 0.61934284 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.1451999 |
| Sum | 21978 |
| Variance | 0.61755906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 19350 | |
| 1 | 11125 | |
| 2 | 4273 | 11.8% |
| 3 | 653 | 1.8% |
| 4 | 77 | 0.2% |
| 5 | 8 | < 0.1% |
| (Missing) | 789 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 19350 | |
| 1 | 11125 | |
| 2 | 4273 | 11.8% |
| 3 | 653 | 1.8% |
| 4 | 77 | 0.2% |
| 5 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 8 | < 0.1% |
| 4 | 77 | 0.2% |
| 3 | 653 | 1.8% |
| 2 | 4273 | 11.8% |
| 1 | 11125 | |
| 0 | 19350 |
booking_status
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 283.5 KiB |
| Not_Canceled | |
|---|---|
| Canceled |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 10.689456 |
| Min length | 8 |
Characters and Unicode
| Total characters | 387760 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not_Canceled |
|---|---|
| 2nd row | Not_Canceled |
| 3rd row | Canceled |
| 4th row | Canceled |
| 5th row | Canceled |
Common Values
| Value | Count | Frequency (%) |
| Not_Canceled | 24390 | |
| Canceled | 11885 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not_canceled | 24390 | |
| canceled | 11885 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 72550 | |
| C | 36275 | |
| a | 36275 | |
| n | 36275 | |
| c | 36275 | |
| l | 36275 | |
| d | 36275 | |
| N | 24390 | 6.3% |
| o | 24390 | 6.3% |
| t | 24390 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 387760 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 72550 | |
| C | 36275 | |
| a | 36275 | |
| n | 36275 | |
| c | 36275 | |
| l | 36275 | |
| d | 36275 | |
| N | 24390 | 6.3% |
| o | 24390 | 6.3% |
| t | 24390 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 387760 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 72550 | |
| C | 36275 | |
| a | 36275 | |
| n | 36275 | |
| c | 36275 | |
| l | 36275 | |
| d | 36275 | |
| N | 24390 | 6.3% |
| o | 24390 | 6.3% |
| t | 24390 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 387760 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 72550 | |
| C | 36275 | |
| a | 36275 | |
| n | 36275 | |
| c | 36275 | |
| l | 36275 | |
| d | 36275 | |
| N | 24390 | 6.3% |
| o | 24390 | 6.3% |
| t | 24390 | 6.3% |
| Booking_ID | no_of_adults | no_of_children | no_of_weekend_nights | no_of_week_nights | type_of_meal_plan | required_car_parking_space | room_type_reserved | lead_time | arrival_year | arrival_month | arrival_date | market_segment_type | repeated_guest | no_of_previous_cancellations | no_of_previous_bookings_not_canceled | avg_price_per_room | no_of_special_requests | booking_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | INN00001 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Not_Canceled |
| 1 | INN00002 | 2.0 | 0.0 | 2.0 | 3.0 | Not Selected | 0.0 | Room_Type 1 | 5.0 | 2018.0 | 11.0 | 6.0 | Online | 0.0 | 0.0 | 0.0 | 106.68 | 1.0 | Not_Canceled |
| 2 | INN00003 | 1.0 | 0.0 | 2.0 | 1.0 | Meal Plan 1 | 0.0 | Room_Type 1 | 1.0 | 2018.0 | 2.0 | 28.0 | Online | 0.0 | 0.0 | 0.0 | 60.00 | 0.0 | Canceled |
| 3 | INN00004 | 2.0 | 0.0 | 0.0 | 2.0 | Meal Plan 1 | 0.0 | Room_Type 1 | 211.0 | 2018.0 | 5.0 | 20.0 | Online | 0.0 | 0.0 | 0.0 | 100.00 | 0.0 | Canceled |
| 4 | INN00005 | 2.0 | 0.0 | 1.0 | 1.0 | Not Selected | 0.0 | Room_Type 1 | 48.0 | 2018.0 | 4.0 | 11.0 | Online | 0.0 | 0.0 | 0.0 | 94.50 | 0.0 | Canceled |
| 5 | INN00006 | 2.0 | 0.0 | 0.0 | 2.0 | Meal Plan 2 | 0.0 | Room_Type 1 | 346.0 | 2018.0 | 9.0 | 13.0 | Online | 0.0 | 0.0 | 0.0 | 115.00 | 1.0 | Canceled |
| 6 | INN00007 | 2.0 | 0.0 | 1.0 | 3.0 | Meal Plan 1 | 0.0 | Room_Type 1 | 34.0 | 2017.0 | 10.0 | 15.0 | Online | 0.0 | 0.0 | 0.0 | 107.55 | 1.0 | Not_Canceled |
| 7 | INN00008 | 2.0 | 0.0 | 1.0 | 3.0 | Meal Plan 1 | 0.0 | Room_Type 4 | 83.0 | 2018.0 | 12.0 | 26.0 | Online | 0.0 | 0.0 | 0.0 | 105.61 | 1.0 | Not_Canceled |
| 8 | INN00009 | 3.0 | 0.0 | 0.0 | 4.0 | Meal Plan 1 | 0.0 | Room_Type 1 | 121.0 | 2018.0 | 7.0 | 6.0 | Offline | 0.0 | 0.0 | 0.0 | 96.90 | 1.0 | Not_Canceled |
| 9 | INN00010 | 2.0 | 0.0 | 0.0 | 5.0 | Meal Plan 1 | 0.0 | Room_Type 4 | 44.0 | 2018.0 | 10.0 | 18.0 | Online | 0.0 | 0.0 | 0.0 | 133.44 | 3.0 | Not_Canceled |
| Booking_ID | no_of_adults | no_of_children | no_of_weekend_nights | no_of_week_nights | type_of_meal_plan | required_car_parking_space | room_type_reserved | lead_time | arrival_year | arrival_month | arrival_date | market_segment_type | repeated_guest | no_of_previous_cancellations | no_of_previous_bookings_not_canceled | avg_price_per_room | no_of_special_requests | booking_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 36265 | INN36266 | 2.0 | 0.0 | 1.0 | 3.0 | Meal Plan 1 | 0.0 | Room_Type 1 | 15.0 | 2018.0 | 5.0 | 30.0 | Online | 0.0 | 0.0 | 0.0 | 100.73 | 0.0 | Not_Canceled |
| 36266 | INN36267 | 2.0 | 0.0 | 2.0 | 2.0 | Meal Plan 1 | 0.0 | Room_Type 2 | 8.0 | 2018.0 | 3.0 | 4.0 | Online | 0.0 | 0.0 | 0.0 | 85.96 | 1.0 | Canceled |
| 36267 | INN36268 | 2.0 | 0.0 | 1.0 | 0.0 | Not Selected | 0.0 | Room_Type 1 | NaN | 2018.0 | 7.0 | 11.0 | Online | 0.0 | 0.0 | 0.0 | 93.15 | 0.0 | Canceled |
| 36268 | INN36269 | 1.0 | 0.0 | 0.0 | 3.0 | Meal Plan 1 | 0.0 | Room_Type 1 | 166.0 | 2018.0 | 11.0 | 1.0 | Offline | 0.0 | 0.0 | 0.0 | 110.00 | 0.0 | Canceled |
| 36269 | INN36270 | 2.0 | 2.0 | 0.0 | 1.0 | Meal Plan 1 | 0.0 | Room_Type 6 | 0.0 | 2018.0 | 10.0 | 6.0 | Online | 0.0 | 0.0 | 0.0 | 216.00 | 0.0 | Canceled |
| 36270 | INN36271 | 3.0 | 0.0 | 2.0 | NaN | Meal Plan 1 | 0.0 | NaN | 85.0 | 2018.0 | 8.0 | 3.0 | Online | NaN | 0.0 | 0.0 | 167.80 | 1.0 | Not_Canceled |
| 36271 | INN36272 | 2.0 | 0.0 | 1.0 | 3.0 | Meal Plan 1 | 0.0 | Room_Type 1 | 228.0 | 2018.0 | 10.0 | 17.0 | Online | 0.0 | 0.0 | 0.0 | 90.95 | 2.0 | Canceled |
| 36272 | INN36273 | 2.0 | 0.0 | 2.0 | 6.0 | Meal Plan 1 | 0.0 | Room_Type 1 | 148.0 | 2018.0 | 7.0 | 1.0 | Online | 0.0 | 0.0 | 0.0 | 98.39 | 2.0 | Not_Canceled |
| 36273 | INN36274 | 2.0 | 0.0 | 0.0 | 3.0 | Not Selected | 0.0 | Room_Type 1 | 63.0 | 2018.0 | 4.0 | 21.0 | Online | 0.0 | 0.0 | 0.0 | 94.50 | 0.0 | Canceled |
| 36274 | INN36275 | 2.0 | 0.0 | 1.0 | 2.0 | Meal Plan 1 | NaN | Room_Type 1 | 207.0 | 2018.0 | 12.0 | 30.0 | Offline | 0.0 | 0.0 | 0.0 | 161.67 | 0.0 | Not_Canceled |